SEQUENCE LISTING 



<110> Kordyum, Vitally A. 

Chernykh, Svitlana I, 
Slavchenko, Iryna Yu . 
Vozianov, Oleksandr 

<120> PHAGE-DEPENDENT SUPER PRODUCTION OF 
BIOLOGICALLY ACTIVE PROTEIN AND PEPTIDES 



<130> PHAGE. 006A 



<150> 09/318,288 
<151> 1999-05-25 



<160> 11 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 630 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> This sequence was chemically synthesized based 
upon the amino acid sequence of human acidic 
fibroblast growth factor (155 amino acids) using 
codons which are used in highly expressed proteins 
from E. coll. 



<221> CDS 

<222> (122) . (590) 



<400> 1 

gcgtagagga tcgagatctc gatcccgcga aattaatacg actcactata ggggaattgt 60 
gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag gagatataca 120 
t atg get gaa ggg gaa ate acc acc ttt aca gcg tta acg gag aaa ttt 169 
Met Ala Glu Gly Glu lie Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 
15 10 15 



aac ctt ccg ccc ggg aat tac aaa aaa ccc aag ctt ctt tac tgc agt 
Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 
20 25 30 



217 



aac gga gga cac ttc ctg cga att ctg cca gat ggc aca gta gat ggg 
Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 
35 40 45 



265 



act cgc gat cgc tec gac cag cac att cag ctg caa etc teg gcc gaa 
Thr Arg Asp Arg Ser Asp Gin His lie Gin Leu Gin Leu Ser Ala Glu 
50 55 60 



313 



age gtt gga gag gtc tat ate aag teg acg gag act ggc cag tac ctt 
Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 



361 



27 



65 



70 



75 



80 



gcc atg gac acc gat ggg ctt ctg tat ggc tea cag acg cct aac gaa 409 
Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 
85 90 95 

gaa tgc ttg ttt eta gaa aga eta gaa gaa aac cat tac aac acg tac 457 
Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 
100 105 110 

ata teg aaa aaa cat gca gag aag aac tgg ttt gta ggc ctt aaa aaa 505 
lie Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 
115 120 125 

aat ggt tec tgt aag cgt gga cca egg act cac tat ggc caa aag get 553 
Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 
130 135 140 

ate ttg ttc ctg cca eta cca gtg age tec gae taa g gatecgaatt 600 
lie Leu Phe Leu Pro Leu Pro Val Ser Ser Asp * 
145 150 155 

cgagctccgt cgacaagett gcggccgcac 630 

<210> 2 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 

15 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 

20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 

35 40 45 

Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 

50 55 60 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
65 70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 

85 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 

100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 

115 120 125 

Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 

130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 



<210> 3 

<211> 468 

<212> DNA 

<213> Homo sapiens 



28 



<400> 3 

atggctgaag 

gggaattaca 

ctt ccggatg 

ctcagtgcgg 

gccatggaca 

ctggaaaggc 

aat tggtttg 

ggccagaaag 



gggaaatcac 
agaagcccaa 
gcacagtgga 
aaagcgtggg 
ccgacgggct 
tggaggagaa 
ttggcctcaa 
caatcttgtt 



caccttcaca 
actcctctac 
tgggacaagg 
ggaggtgtat 
tttatacggc 
ccattacaac 
gaagaatggg 
tctccccctg 



gccctgaccg 
tgtagcaacg 
gacaggagcg 
ataaagagta 
tcacagacac 
acctatatat 
agctgcaaac 
ccagtctctt 



agaagtttaa 
ggggccactt 
accagcacat 
ccgagactgg 
caaatgagga 
ccaagaagca 
gcggtcctcg 
ctgattaa 



tctgcctcca 60 
cctgaggatc 120 
tcagctgcag 180 
ccagtacttg 240 
atgtttgttc 300 
tgcagagaag 360 
gactcactat 420 
468 



<210> 4 
<211> 630 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence is a chemically synthesized sequence 
encoding a 134 amino acid form of fibroblast 
growth factor with alterations for preferred codon 
usage in E. coli 



<221> CDS 
<222> (122) 



[526} 



60 



<400> 4 

gcgtagagga tcgagatctc gatcccgcga aattaatacg actcactata ggggaattgt 
gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag gagatataca 120 
t atg aat tac aaa aaa ccc aag ctt ctt tac tgc agt aac gga gga cac 169 
Met Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser Asn Gly Giy His 
15 10 15 



ttc ctg cga att ctg cca gat ggc aca gta gat ggg act cgc gat cgc 
Phe Leu Arg lie Leu Pro Asp Gly Thr Val Asp Gly Thr Arg Asp Arg 
20 25 30 



217 



tec gac cag cac att cag ctg caa etc teg gcc gaa age gtt gga gag 
Ser Asp Gin His lie Gin Leu Gin Leu Ser Ala Glu Ser Val Gly Glu 
35 40 45 



265 



gtc tat ate aag teg acg gag act ggc cag tac ctt gcc atg gac acc 
Val Tyr lie Lys Ser Thr Glu Thr Gly Gin Tyr Leu Ala Met Asp Thr 
50 55 60 



313 



gat ggg ctt ctg tat ggc tea cag acg cct aac gaa gaa tgc ttg ttt 
Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu Glu Cys Leu Phe 
65 70 75 80 



361 



eta gaa aga eta gaa gaa aac cat tac aac acg tac ata teg aaa aaa 
Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr lie Ser Lys Lys 
85 90 95 



409 



cat gca gag aag aac tgg ttt gta ggc ctt aaa aaa aat ggt tec tgt 457 
His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys Asn Gly Ser Cys 
100 105 110 

aag cgt gga cca egg act cac tat ggc caa aag get ate ttg ttc ctg 505 
Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala lie Leu Phe Leu 



29 



115 



120 



125 



cca eta cca gtg age tec gac taaggatccg aattcgagct ccgtcgacaa 556 
Pro Leu Pro Val Ser Ser Asp 
130 135 

gcttgcggcc gcactcgagc accaccacca ccaccactga gatccggctg ctaacaaagc 616 
ccgaaaggaa gctg 630 

<210> 5 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Translated protein sequence for the chemically 
synthesized 134 amino acid form of fibroblast 
growth factor 

<400> 5 



Met 


Asn 


Tyr 


Lys 


Lys 


Pro 


Lys 


Leu 


Leu 


Tyr 


Cys 


Ser 


Asn 


Gly 


Gly 


His 


1 








5 










10 










15 




Phe 


Leu 


Arg 


He 
20 


Leu 


Pro 


Asp 


Gly 


Thr 
25 


Val 


Asp 


Gly 


Thr 


Arg 
30 


Asp 


Arg 


Ser 


Asp 


Gin 
35 


His 


He 


Gin 


Leu 


Gin 
40 


Leu 


Ser 


Ala 


Glu 


Ser 
45 


Val 


Gly 


Glu 


Val 


Tyr 
50 


He 


Lys 


Ser 


Thr 


Glu 
55 


Thr 


Gly 


Gin 


Tyr 


Leu 
60 


Ala 


Met 


Asp 


Thr 


Asp 


Gly 


Leu 


Leu 


Tyr 


Gly 


Ser 


Gin 


Thr 


Pro 


Asn 


Glu 


Glu 


Cys 


Leu 


Phe 


65 










70 










75 










80 


Leu 


Glu 


Arg 


Leu 


Glu 
85 


Glu 


Asn 


His 


Tyr 


Asn 
90 


Thr 


Tyr 


He 


Ser 


Lys 
95 


Lys 


His 


Ala 


Glu 


Lys 
100 


Asn 


Trp 


Phe 


Val 


Gly 
105 


Leu 


Lys 


Lys 


Asn 


Gly 
110 


Ser 


Cys 


Lys 


Arg 


Gly 
115 


Pro 


Arg 


Thr 


His 


Tyr 
120 


Gly 


Gin 


Lys 


Ala 


He 
125 


Leu 


Phe 


Leu 


Pro 


Leu 
130 


Pro 


Val 


Ser 


Ser 


Asp 
135 





















<210> 6 
<211> 630 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This sequence is a chemically synthesized sequence 
encoding a 14 0 amino acid form of fibroblast 
growth factor with alterations for preferred codon 
usage in E. coli 

<221> CDS 

<222> (122) . . . (544) 
<400> 6 

gcgtagagga tcgagatctc gatcccgcga aattaatacg actcactata ggggaattgt 60 
gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag gagatataca 120 



30 



t atg ttt aac ctt ccg ccc ggg aat tac aaa aaa ccc aag ctt ctt tac 169 
Met Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr 
15 10 15 

tgc agt aac gga gga cac ttc ctg cga att ctg cca gat ggc aca gta 217 
Cys Ser Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val 
20 25 30 

gat ggg act cgc gat cgc tec gac cag cac att cag ctg caa etc teg 265 
Asp Gly Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser 
35 40 45 

gcc gaa age gtt gga gag gtc tat ate aag teg acg gag act ggc cag 313 
Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin 
50 55 60 

tac ctt gcc atg gac acc gat ggg ctt ctg tat ggc tea cag acg cct 361 
Tyr Leu Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro 
65 70 75 80 

aac gaa gaa tgc ttg ttt eta gaa aga eta gaa gaa aac cat tac aac 409 
Asn Glu Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn 
85 90 95 

acg tac ata teg aaa aaa cat gca gag aag aac tgg ttt gta ggc ctt 457 
Thr Tyr lie Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu 
100 105 110 

aaa aaa aat ggt tec tgt aag cgt gga cca egg act cac tat ggc caa 505 
Lys Lys Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin 
115 120 125 

aag get ate ttg ttc ctg cca eta cca gtg age tec gac taaggatccg 554 
Lys Ala He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
130 135 140 

aattcgagct ccgtcgacaa gcttgcggcc gcactcgagc accaccacca ccaccactga 614 
gatccggctg ctaaca 630 

<210> 7 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Translated protein sequence for the chemically 
synthesized 140 amino acid form of fibroblast 
growth factor 

<400> 7 

Met Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr 

15 10 15 

Cys Ser Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val 

20 25 30 

Asp Gly Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser 

35 40 45 

Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin 



31 



50 

Tyr Leu Ala Met Asp 
65 

Asn Glu Glu Cys Leu 
85 

Thr Tyr He Ser Lys 

100 

Lys Lys Asn Gly Ser 
115 

Lys Ala He Leu Phe 
130 



55 

Thr Asp Gly Leu Leu Tyr 
70 75 
Phe Leu Glu Arg Leu Glu 
90 

Lys His Ala Glu Lys Asn 
105 

Cys Lys Arg Gly Pro Arg 
120 

Leu Pro Leu Pro Val Ser 
135 



60 

Gly Ser Gin Thr Pro 
80 

Glu Asn His Tyr Asn 
95 

Trp Phe Val Gly Leu 
110 

Thr His Tyr Gly Gin 

125 
Ser Asp 
140 



<210> 8 
<211> 1822 
<212> DNA 

<213> Artificial Sequence 



<220> 
<221> 
<222> 


TATA_s 


ignal 
M 07 ^ 


<221> 
<222> 


CDS 
(193) . 


, . (202) 


<221> 
<222> 


intron 
(203) . 


. . (458) 


<221> 
<222> 


CDS 
(459) . 


. . (619) 


<221> 
<222> 


intron 
(620) . 


. . (828) 


<221> 
<222> 


CDS 
(829) . 


. . (948) 


<221> 
<222> 


intron 
(949) . 


. . (1041) 


<221> 
<222> 


CDS 
(1042) 


. , . (1206) 


<221> 
<222> 


intron 
(1207) 


. . . (1459) 


<221> 
<222> 


CDS 
(1460) 


. . . (1654) 



<223> Chemically synthesized sequence for Human Growth 
Hormone using codons preferred for expression in 
E. coli 

<400> 8 

ggagcttcta aattatccat tagcacaagc ccgtcagtgg ccccatgcat aaatgtacac 60 

agaaacaggt gggggcaaca gtgggagaga aggggccagg gtataaaaag ggcccacaag 120 

agaccggctc aaggatccca aggcccaact ccccgaacca ctcagggtcc tgtggacgct 180 



32 



cacctagctg ca atg get aca g gtaagcgccc ctaaaatccc tttgggcaca 
Met Ala Thr 
1 



232 



atgtgtcctg aggggagagg cagcgacctg tagatgggac gggggcacta accctcaggt 292 

ttggggcttc tgaatgagta tcgccatgta agcccagtat ggccaatctc agaaagctcc 352 

tggtccctgg agggatggag agagaaaaac aaacagctcc tggagcaggg agagtgctgg 412 

cctcttgctc tccggctccc tctgttgccc tctggtttct ccccag gc tec egg aeg 469 

Gly Ser Arg Thr 
5 



tec ctg etc ctg get ttt ggc ctg etc tgc ctg ccc tgg ett eaa gag 
Ser Leu Leu Leu ^la Phe Gly Leu Leu Cys Leu Pro Trp Leu Gin Glu 
10 15 20 



517 



ggc agt gee ttc cca ace att ccc tta tec agg ett ttt gae aac get 
Gly Ser Ala Phe Pro Thr He Pro Leu Ser Arg Leu Phe Asp Asn Ala 
25 30 35 



565 



atg etc ege gcc cat cgt ctg eac cag ctg gee ttt gac acc tac cag 
Met Leu Arg Ala His Arg Leu His Gin Leu Ala Phe Asp Thr Tyr Gin 
40 45 50 55 



613 



gag ttt gtaagetctt ggggaatggg tgcgeatcag gggtggcagg aaggggtgac 
Glu Phe 



669 



tttcccccgc tgggaaataa gaggaggaga ctaaggagct cagggttttt eccgaagega 729 
aaatgeaggc agatgagcac acgctgagtg aggtteccag aaaagtaaca atgggagctg 789 
gtctccagcg tagaccttgg tgggcggtcc ttetectag gaa gaa gee tat ate 843 

Glu Glu Ala Tyr He 
60 

cea aag gaa cag aag tat tea ttc ctg cag aac ccc cag ace tec etc 891 
Pro Lys Glu Gin Lys Tyr Ser Phe Leu Gin Asn Pro Gin Thr Ser Leu 
65 70 75 

tgt ttc tea gag tct att ceg aca ccc tee aac agg gag gaa aca caa 939 
Cys Phe Ser Glu Ser He Pro Thr Pro Ser Asn Arg Glu Glu Thr Gin 
80 85 90 

cag aaa tee gtgagtggat gcettgaccc caggcgggga tgggggagac 988 
Gin Lys Ser 
95 

ctgtagtcag agcccccggg cagcacaggc caatgcccgt ccttcccctg cag aac 1044 

Asn 



eta gag ctg etc cgc ate tec ctg ctg etc ate cag teg tgg ctg gag 
Leu Glu Leu Leu Arg He Ser Leu Leu Leu He Gin Ser Trp Leu Glu 
100 105 110 



1092 



ccc gtg cag ttc etc agg agt gtc ttc gcc aac age ctg gtg tac ggc 
Pro Val Gin Phe Leu Arg Ser Val Phe Ala Asn Ser Leu Val Tyr Gly 
115 120 125 130 



1140 



33 



gcc tct gac age aac gtc tat gac etc eta aag gac eta gag gaa ggc 
Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu Glu Glu Gly 
135 140 145 



1188 



ate caa acg ctg atg ggg gtgggggtgg cgctaggggt ccccaatctt 1236 
lie Gin Thr Leu Met Gly 
150 

ggagceecae tgactttgag agctgtgtta gagaaacact gctgccctct ttttagcagt 1296 
ccaggccctg acccaagaga actcacctta ttcttcattt cccctcgtga atcctctagc 1356 
ctttctctac accctgaagg ggagggagga aaatgaatga atgagaaagg gagggagcag 1416 
tacccaagcg cttggectct ccttctcttc cttcactttg cag agg ctg gaa gat 1471 

Arg Leu Glu Asp 
155 



gge age cec egg act ggg cag ate ttc aag cag ace tac age aag ttc 1519 
Gly Ser Pro Arg Thr Gly Gin He Phe Lys Gin Thr Tyr Ser Lys Phe 
160 165 170 

gae aca aac tea eac aac gat gac gca eta etc aag aac tac ggg ctg 1567 
Asp Thr Asn Ser His Asn Asp Asp Ala Leu Leu Lys Asn Tyr Gly Leu 
175 180 185 

etc tac tgc ttc agg aag gac atg gac aag gtc gag aca ttc ctg cgc 1615 
Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg 
190 195 200 

ate gtg cag tgc cgc tct gtg gag ggc age tgt ggc ttc tagctgcccg 
He Val Gin Cys Arg Ser Val Glu Gly Ser Cys Gly Phe 
205 210 215 

ggtggcatcc ctgtgacccc tccccagtgc ctctcctggc cttggaagtt gccactccag 1724 
tgcccaccag ccttgtccta ataaaattaa gttgcatcat tttgtctgac taggtgtcct 1784 
ctataatatt atggggtgga ggggggtggt ttggagca 1822 



1664 



<210> 9 
<211> 217 
<212> PRT 

<213> Homo sapiens 



<400> 9 



Met 


Ala 


Thr 


Gly 


Ser 


Arg 


Thr 


Ser 


Leu 


Leu 


Leu 


Ala 


Phe 


Gly 


Leu 


Leu 


1 






5 










10 










15 




Cys 


Leu 


Pro 


Trp 


Leu 


Gin 


Glu 


Gly 


Ser 


Ala 


Phe 


Pro 


Thr 


He 


Pro 


Leu 






20 










25 










30 






Ser 


Arg 


Leu 


Phe 


Asp 


Asn 


Ala 


Met 


Leu 


Arg 


Ala 


His 


Arg 


Leu 


His 


Gin 




35 










40 










45 








Leu 


Ala 
50 


Phe 


Asp 


Thr 


Tyr 


Gin 
55 


Glu 


Phe 


Glu 


Glu 


Ala 
60 


Tyr 


He 


Pro 


Lys 


Glu 


Gin 


Lys 


Tyr 


Ser 


Phe 


Leu 


Gin 


Asn 


Pro 


Gin 


Thr 


Ser 


Leu 


Cys 


Phe 


65 






70 










75 










80 


Ser 


Glu 


Ser 


He 


Pro 

85 


Thr 


Pro 


Ser 


Asn 


Arg 
90 


•Glu 


Glu 


Thr 


Gin 


Gin 
95 


Lys 


Ser 


Asn 


Leu 


Glu 
100 


Leu 


Leu 


Arg 


He 


Ser 

105 


Leu 


Leu 


Leu 


He 


Gin 
110 


Ser 


Trp 


Leu 


Glu 


Pro 
115 


Val 


Gin 


Phe 


Leu 


Arg 
120 


Ser 


Val 


Phe 


Ala 


Asn 
125 


Ser 


Leu 


Val 



34 



■ 1' ir iipnm|ii 



Tyr 


Gly Ala 


Ser 


Asp 


Ser 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Lys 


Asp 


Leu 


Glu 




130 








135 










140 










Glu 


Gly lie 


Gin 


Thr 


Leu 


Met 


Gly 


Arg 


Leu 


Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


145 








150 










155 










160 


Thr 


Gly Gin 


He 


Phe 


Lys 


Gin 


Thr 


Tyr 


Ser 


Lys 


Phe 


Asp 


Thr 


Asn 


Ser 








165 










170 










175 




His 


Asn Asp 


Asp 


Ala 


Leu 


Leu 


Lys 


Asn 


Tyr 


Gly 


Leu 


Leu 


Tyr 


Cys 


Phe 






180 










185 










190 






Arg 


Lys Asp 


Met 


Asp 


Lys 


Val 


Glu 


Thr 


Phe 


Leu 


Arg 


He 


Val 


Gin 


Cys 




195 










200 










205 








Arg 


Ser Val 


Glu 


Gly 


Ser 


Cys 


Gly 


Phe 

















210 215 



<210> 10 
<211> 990 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Chemically synthesized sequence for human 
interferon alpha- 2b 

<221> promoter 
<222> (231) . . . (249) 



<221> CDS 
<222> (320) 



. (784) 



<400> 10 

gggcgctgac 

tgttgctcag 

tgattcattc 

gagcacgatc 

tcactatagg 

ttaagaagga 



ttccgcgttt ccagacttta 
gtcgcagacg ttttgcagca 
tgctaaccag taaggcaacc 
atgcgcaccc gtggggccgc 
ggaattgtga gcggataaca 
gatatacat atg get gaa 
Met Ala Glu 
1 



cgaaacacgg aaaccgaaga ccattcatgt 60 
gcagtcgctt cacgttcgct cgcgtatcgg 120 
ccgccagcct agccgggtcc tcaacgacag 180 
cagatctcga tcccgcgaaa ttaatacgac 240 
attcccctct agaaataatt ttgtttaact 300 
ggg gaa ate acc acc ttt aea gcg 352 
Gly Glu He Thr Thr Phe Thr Ala 
5 10 



tta acg gag aaa ttt aac ctt ccg ccc ggg aat tac aaa aaa ccc aag 
Leu Thr Glu Lys Phe Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys 
15 20 25 



100 



ctt ctt tac tgc agt aac gga gga cac ttc ctg cga att ctg cca gat 
Leu Leu Tyr Cys Ser Asn Gly Gly His Phe Leu Arg He Leu Pro Asp 
30 35 40 



448 



ggc aca gta gat ggg act cgc gat cgc tec gac cag cac att cag ctg 
Gly Thr Val Asp Gly Thr Arg Asp Arg Ser Asp Gin His He Gin Leu 
45 50 55 



496 



caa etc teg gee gaa age gtt gga gag gte tat ate aag teg acg gag 
Gin Leu Ser Ala Glu Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu 
60 65 70 75 



544 



act ggc cag tac ctt gee atg gac acc gat ggg ctt ctg tat ggc tea 
Thr Gly Gin Tyr Leu Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser 



592 



35 



80 



85 



90 



cag acg cct aac gaa gaa tgc ttg ttt eta gaa aga eta gaa gaa aac 640 
Gin Thr Pro Asn Glu Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn 
95 100 105 

cat tac aac acg tac ata teg aaa aaa cat gca gag aag aac tgg ttt 688 
His Tyr Asn Thr Tyr lie Ser Lys Lys His Ala Glu Lys Asn Trp Phe 
110 115 120 

gta ggc ctt aaa aaa aat ggt tec tgt aag egt gga cca egg act cac 736 
Val Gly Leu Lys Lys Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His 
125 130 135 

tat ggc eaa aag get ate ttg ttc ctg cca eta cca gtg age tec gac 784 
Tyr Gly Gin Lys Ala lie Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
140 145 150 155 

taaggatceg aattcgagct ecgtcgacaa gcttgcggcc gcactcgage aceaccacca 844 

ceaceaetga gatccggetg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac 904 

egctgagcaa taaetagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 964 

getgaaagga ggaactatat ccggat 990 

<210> 11 
<211> 155 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Translated protein sequence for the chemically 
synthesized human interferon alpha-2b 

<4Q0> 11 

Met Ala Glu Gly Glu He Thr Thr Phe Thr Ala Leu Thr Glu Lys Phe 

15 10 15 

Asn Leu Pro Pro Gly Asn Tyr Lys Lys Pro Lys Leu Leu Tyr Cys Ser 

20 25 30 

Asn Gly Gly His Phe Leu Arg He Leu Pro Asp Gly Thr Val Asp Gly 

35 40 45 

Thr Arg Asp Arg Ser Asp Gin His He Gin Leu Gin Leu Ser Ala Glu 

50 55 60 

Ser Val Gly Glu Val Tyr He Lys Ser Thr Glu Thr Gly Gin Tyr Leu 
65 70 75 80 

Ala Met Asp Thr Asp Gly Leu Leu Tyr Gly Ser Gin Thr Pro Asn Glu 

85 90 95 

Glu Cys Leu Phe Leu Glu Arg Leu Glu Glu Asn His Tyr Asn Thr Tyr 

100 105 110 

He Ser Lys Lys His Ala Glu Lys Asn Trp Phe Val Gly Leu Lys Lys 

115 120 125 

Asn Gly Ser Cys Lys Arg Gly Pro Arg Thr His Tyr Gly Gin Lys Ala 

130 135 140 

He Leu Phe Leu Pro Leu Pro Val Ser Ser Asp 
145 150 155 
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